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ABSTRACT 


This paper presents a preliminary study related to the detection and 
identification of cardiac sounds components including first sound (S1), 
second sound (S2) and murmurs. Detection and identification of cardiac 
sounds are an important process in automated cardiac sound analysis system 
in order to automatically diagnose people who are having cardiovascular 
disorder and determine the existence of murmurs. Sixteen of recorded cardiac 
sounds (eight normal cardiac sounds, four abnormal cardiac sounds with 
systole murmur, and four abnormal cardiac sounds with diastole murmur) 
from PASCAL Classifying Heart Sounds Challenge database were examined 
for analysis. This work is significant in studying the time and time-frequency 
based detection of cardiac sounds components characteristics. In time-based 
analysis, envelope of signal energy was used to do the peak detection of S1, 
S2 and murmur and also analysis of cardiac cycle, systole and diastole 
duration. While time-frequency based analysis was used to determine the S1, 


Phonocardiogram S2 and murmur frequency range. The findings yield the overall accuracy of 
envelope-based detection for normal cardiac sound signal at 60.85% while 
for abnormal cardiac sound signal at 57.24%. 
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1, INTRODUCTION 

Cardiac murmurs are abnormal cardiac sound during heartbeat cycle. They may differ in terms of 
timing, location, intensity, quality and positional changes [1-2]. They are produced from valve defects that 
can conventionally be heard with an acoustic stethoscope. They are rasping sound due to the generation of 
vibration caused by the abnormal movement of blood within or between cardiac chambers, or by turbulent 
flow within the pulmonary artery or aorta just distal to the outflow valve [2]. Towards modern technology, 
cardiac sounds including first sound (S1), second sound (S2) and murmurs can be heard and visualized via 
digital stethoscope which is known as phonocardiogram (PCG). To be noted here, S1 1s defined as “Lub’ 
caused by the tricuspid and mitral (bicuspid) valve, while S2 is defined as “Dub” caused by the aortic and 
pulmonary valve [2]. If a murmur exists in systole which is in cardiac ventricle contraction mode, between 
the SI and S2 cardiac sounds, it is known as a “systolic murmur.” If it exists in diastole, which is in cardiac 
ventricle relaxation mode (between S2 and S1), it is known as a “diastolic murmur” [2]. 

In recent years, the detection of S1, S2 and murmurs can be realized automatically through signal 
processing approach and mathematical algorithm [3-5]. It is an essential process in pre-processing stage 
before proceeding to the feature extraction and classification stage for automated cardiac sound analysis 
system as shown in Figure | [3]. Initially, a physician captures and records cardiac sounds using an electronic 
stethoscope which is embedded with the amplifier and filter circuit. Then, the cardiac sound signal is 
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converted into digital signals that can be read by the computer. Next, the cardiac sound signal is sampled, 
segmented, filtered and analyzed with the sampling frequency in signal pre-processing process. The last 
process of automated cardiac sound analysis system is signal post-processing and analysis which consists of 
feature extraction and classification techniques. 
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Figure 1. A general block diagram of automated cardiac sound analysis system [3] 


A considerable amount of literature has already been published on envelope-based detection of S1, 
S2 and murmur, which used envelope of Shannon energy to determine the distinct peaks either SI or S2 in 
time series [6-12]. Most of them used Shannon energy in envelope-based approach due to its advantages can 
increase the medium intensity signal and attenuates the result of low intensity signal to the high intensity 
signal [13]. However, it suffers a lot of faulty detections when applying the threshold to detect the peak 
signal [14]. Authors in [6] claimed that Sl and S2 were detected when using low pass finite impulse response 
(FIR) filter with 100 Hz of cut-off frequency, but it will cause the elimination of murmur. Then, [7-8] articles 
are only limited to analysis of SI and S2 duration without analysis of systole and diastole duration. On the 
other hand, authors in [7, 9, 10-11] focused on the detection of SI and S2 of cardiac sound signal without 
murmur detection. Otherwise, authors in [9-10, 12] used wavelet transform for denoising of cardiac sound 
signal in SI and S2 detection. The difficulty of that approach is to adjust a few thresholds for reconstruction 
of the cardiac sound signal [14]. Moreover, other study used external reference such as Electrocardiogram 
(ECG) signal for detection of SI and S2 in PCG signal [15]. 

Due to the aforementioned constraints, the aim of this paper is to study cardiac sound signal 
components that include Sl, S2 and murmur characteristics detection in time and time-frequency (T-F) 
domain analysis. A squared energy envelope is proposed in this paper because of its better performance 
compared to Shannon energy envelope-based approach with the peak conditioning for time domain analysis 
[16]. While Fourier synchro squeezed transform is used for time-frequency domain analysis. So, the paper is 
organized as follows: Section 2 discusses the methods or steps of research work. Next, Section 3 presents the 
results and briefly discusses the findings and finally Section 4 1s devoted to conclusions. 


2. RESEARCH METHOD 

In this study, eight audios of normal and abnormal cardiac sounds from PASCAL Classifying Heart 
Sounds Challenge [17] were used for analysis in time and time-frequency (T-F) domain, respectively. 
The abnormal cardiac sounds consist of four cardiac sounds with systole murmur and other four cardiac 
sounds with diastole murmur. The steps of research work for time domain analysis are illustrated in Figure 2 
by firstly reading the audio file to get raw data and frequency sampling, fs. Since the frequency sampling of 
cardiac sound signal is 44100 Hz which leads to the computational burden, so, the signal was down-sampled 
into 2000 Hz [18]. Next, bandpass Butterworth filter with five order and cut off frequency between 25 and 
250 Hz were chosen due to frequency range of SI and S2 events [3]. Then, the signal, x(1) was normalized to 
ensure the peak signal is more prominent as shown in (1); 


x(i) 


Xnorm (i) — max(|x(i)|) 


(1) 
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where x(i) 1s the original signal and xnorm (i) is normalized signal. After that, signal energy, s(i) of 
normalized signal was calculated as shown in (2). This is known as squared energy, which is chosen in this 
research work since the intensity of Sl and S2 are significantly different to be identified with linear threshold 
for peak conditioning [16]. 


N 
5O= ) Cum OY 
i=1 
(2) 


Signal energy was enveloped to find energy peak of SI, S2 and murmur. All the stationary noise 
signal energy, which is less than 0.03 was removed in smoothing process since S1 and S2 energy peak are 
basically greater than that value [19]. The reason is to reduce false detection of cardiac sound signal. In this 
paper, upper and lower threshold were pre-defined, where the upper threshold, up was firstly set to 70% of 
maximum peak while lower threshold, lo, was set to 0% of maximum peak for normal cardiac sound signal 
case. But, for abnormal cardiac sound signal case, the lower threshold was pre-defined as 30% of maximum 
peak. Then, these thresholds were manually adjusted based on the current peak energy signal of S1, S2 and 
murmur for different recorded cardiac sound file as illustrated in Table 1. If the energy peak signal 1s greater 
than upper threshold, S1 was detected, while if energy peak signal is greater than lower threshold and lower 
than upper threshold, then, S2 was detected. Otherwise, murmur was detected if the peak energy is lower 
than lower threshold. 
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Figure 2. Flowchart of S1, S2 and murmur cardiac sounds detection using energy 
envelope-based detection 


For T-F domain analysis, the filtered and normalized cardiac sound signal was analyzed using 
Fourier synchro squeezed transform, see Figure 3. The advantage of this approach is the time-frequency of 
signal event information is known and time-scale representation is sharpened. This was realized using fsst 
function in Matlab. The time series of cardiac sound signal was transformed into short-time Fourier transform 
with hanning window. The transform values were then “squeezed” so that they concentrate around curves of 
instantaneous frequency in the time-frequency plane [20]. The resulting synchrosqueezed transform was 
shown in (3). 
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Tf (ta) = {| Veflems (wy f(tm) dn 


(3) 


where is short-time Fourier transform of a function, f, using a spectral hanning window, g with 0.13s length 
and is instantaneous frequencies that are estimated with the phase transform [20]. 


Table 1. Threshold Variation based on Different Cardiac Sound Types 
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Figure 3. Flowchart of S1, S2 and murmur cardiac sounds detection using Fourier 
synchrosqueezed transform 


3. RESULTS AND ANALYSIS 

Figure 4 (a) shows the peak detection of SI and S2 in normal cardiac sound signal based on the 
envelope of normalized signal energy for one of cardiac sound data from PASCAL database. While the peak 
detection of abnormal cardiac sound with systole and diastole murmur in time domain are shown in 
Figure 4(b) and Figure 4 (c), respectively. From Figure 4(a), Figure 4(b) and Figure 4(c), the red circle 
indicates Sl peak energy level if the peak is greater than upper threshold while the black circle refers to S2 
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peak energy level, if it is lower than upper threshold. Then, the blue circle indicates murmur peak energy 
level if the peak lower is than lower threshold. Based on time domain analysis, the murmur detection is a 
challenging task to do using peak conditioning approach when its energy level is approximately similar to the 
noise energy level. On the other hand, murmur energy peak might be vanished during smoothing process that 
will cause incorrect detection. Since there is approximately similar energy peak between noise and murmur 
energy, it is difficult to be determined in time series. For validation of murmur detection, it is analyzed using 
Fourier synchro squeezed transform as shown in Figure 4(d)-(f). For normal cardiac sound signal case, it is 
illustrated in Figure 4(d). As for abnormal cardiac sound signal case, it is illustrated in Figure 4(e) and Figure 
4(f), which are cardiac sound signal with systole and diastole murmur, respectively. 
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Figure 4. S1, S2 and murmur cardiac sounds detection using envelope of signal energy and Fourier 
synchro squeezed transform 
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It can be seen that SI, S2 and murmurs are clearly identified based on T-F domain. There are time 
and frequency information for S1, S2 and murmur events. The murmur events are identified by red dashed 
line. The different color includes dark blue, light blue and yellow denotes spectral power. The yellow color 
denotes the higher power, and the dark blue color denotes the lower power, and it can be seen clearly that the 
frequency content changes during the heartbeat cycle. S1 of cardiac sound signal produces higher spectral 
power compared to S2 for normal and abnormal cardiac sound signal. 

Table 2(a) shows the summary of duration of cardiac cycle, systole, and diastole in time domain 
analysis. Sl and S2 frequency range are also identified in T-F domain analysis for eight data of normal 
cardiac sound signal. While Table 2(b) shows the duration summary of cardiac cycle, systole, and diastole in 
time domain analysis for abnormal cardiac sound. It also tabulates S1, S2 and murmur frequency range in T- 
F domain analysis for eight data of abnormal cardiac sound signal. The average of cardiac cycle value for 
normal cardiac sound signal is 0.73s while the average value for abnormal cardiac cycle is 0.87s, which is 
much longer. This finding agreed with other study that claimed the cardiac cycle varied with the presence of 
cardiac murmurs [6]. Then, the average duration of diastole is longer than the duration of the systole for 
normal and abnormal cardiac sound signal, which are 0.33s and 0.39s respectively, which is also supported 
by [21]. To be noted here, the duration of diastole for abnormal cardiac sound signal is longer compared to 
normal cardiac sound signal with 0.06s of time difference due to the existence of murmurs. 

Since time domain analysis can only obtain the time information of cardiac sound signal event, 
the analysis of frequency range for $1, S2 and murmur had been obtained using Fourier synchro squeezed 
transform as mentioned in Table 2(a) and (b) for normal and abnormal cardiac sound signal respectively from 
manual observation. The average of frequency range of S1 for normal cardiac sound signal is between 29 and 
230 Hz while for abnormal cardiac sound signal is between 27 and 229 Hz. Next, S2 frequency range is 
between 35 and 195 Hz for normal cardiac sound signal while from 25 to 226 Hz is produced by abnormal 
cardiac sound signal. To be noted here, there are overlapping frequency among SI, S2 and murmur 
frequency, which the murmur frequency range is located between frequency range of S1 and S2 cardiac 
sound signal, with the average value is between 75 and 235 Hz for abnormal cardiac sound signal. 
This finding agreed with the theoretical study on the cardiac sound and murmur frequency range, which are 
between 24 Hz and 1024Hz as stated in [3, 22]. 

The evaluation of energy envelope-based detection performance was presented using confusion 
matrix as shown in Table 3 and Table 4 for normal and abnormal cardiac sound signal respectively. 
The classification of Sl, S2 and murmur class is based on input of maximum peak energy for each class 
using human observation on the labelled energy peak. From the confusion matrix tables, the diagonal 
elements of the matrix have been highlighted with blue color for both cases. The diagonal cells contain the 
number of correctly identified Sl, S2 and murmurs maximum peaks. So, the performance of energy 
envelope-based detection was measured based on the overall accuracy, OA, which is the ratio of total of 
diagonal cells to the number of samples, n as shown in (4). 


_ Leet cell; 


= (4) 


OA 


As mentioned in Table 3, the number of samples for normal cardiac sounds are 235, which are 
counted by the total number of SI and S2 from eight normal recorded sounds. The number of correctly 
detected S1 is 69 counts while the number of correctly detected S2 1s 74 counts. The fault detection of S1 is 
high because its energy peak is approximately similar to that of S2. Moreover, it is also contributed by S1 
split sound, that occurred by the adjournment of the tricuspid valve closure [23] or wider S2 split [24]. 
Conclusively, it is difficult to correctly detect either Sl or S2 peak using peak conditioning approach. 
The result exhibits the overall accuracy for detection of Sl and S2 of cardiac sound signal is 60.85% for 
normal case. For abnormal cardiac sound signal detection case as shown in Table 4, the overall accuracy 
produced is 57.24%, where S1 is 64 counts, S2 1s 56 counts and murmurs are 50 counts with 297 of samples. 
To be noted here, there is incorrect detection of murmur class when the energy peak signal is lower than 
lower threshold since the total number of samples is different with the prediction value. So, the result of 
abnormal cardiac sound detection is lower compared to that of normal cardiac sound. The reason is the 
detection process based on peak conditioning approach become a challenging task when dealing with the 
murmur. There is false detection for murmur detection especially in case of systole murmur as well as 
diastole murmur. This is because of murmur energy level is approximately similar to the noise energy level. 
In short, envelope-based detection approach suffered from misdetection issue when there are noises and peak 
variations [25]. 
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Table 2. Components of Normal and Abnormal Cardiac Sound Signal Characteristics in Time and Time- 
Frequency Domain Analysis 


(a) Normal Cardiac Sound Signal (b) Abnormal Cardiac Sound Signal 
Time Domain T-F Domain Time Domain T-F Domain 
Cardiac Sys. Dia. S1 freq. S2 freq. Cardiac Sys. Dia. S1 freq. S2 freq. Mur-mur 
cycle (s) — dur. (s) dur. (s) (Hz) (Hz) cycle (s) dur. (s) dur. (s) (Hz) (Hz) freq. (Hz) 
0.68 0.19 0.30 23-304 31-266 0.80 0.25 0.35 31-242 31-234 63-234 
0.61 0.16 0.27 39-250 31-226 0.84 0.22 0.35 23-219 23-250 70-226 
0.89 0.20 0.42 23-148 31-78 1.02 0.24 0.56 23-250 31-234 86-250 
0.88 0.26 0.55 31-258 46-219 0.80 0.28 0.35 31-234 31-218 63-250 
0.65 0.20 0.29 23-266 55-234 1.05 0.28 0.41 31-218 31-203 86-234 
0.68 0.10 0.25 31-180 39-188 0.76 0.23 0.40 23-226 31-234 70-234 
0.8 0.25 0.35 31-226 23-172 0.82 0.27 0.39 23-210 23-226 90-218 
0.66 0.11 0.22 31-211 23-180 0.86 0.22 0.33 23-234 23-210 70-234 
0.73 0.18 0.33 29-230 35-195 (0.87 0.25 0.39 27-229 25-226 75-235 
Table 3. Confusion matrix for normal cardiac Table 4. Confusion matrix for abnormal cardiac 
sound detection sound detection 
Predicted Predicted 
n= 235 Sl S2 Sum n= 297 Sl S2 Murmur Sum 
3 Sl 69 11 80 = Sl 64 7 0 Fi 
3 as S82 81 74 155 3 a S2 11 56 3 70 
Murmur 0 1 50 a1 
Sum 150 85 Sum fps, 64 53 
Overall Accuracy 60.85% Overall Accuracy 57.24% 


4. CONCLUSION 

The cardiac sound components characteristics and envelope-based detection approach have been 
investigated for normal and abnormal cardiac sound signals respectively. For cardiac sound components 
characteristics, the result shows that the average of cardiac cycle for abnormal cardiac sound was longer 
compared to normal cardiac sound due to the existence of murmur. Since time-based analysis only have the 
time information of $1, S2 and murmur event, Fourier synchrosqueezed transform has been used to identify 
the time-frequency information of S1, S2 and murmur events. While for enveloped-based peak detection of 
cardiac sounds include SI, S2 and murmur, the normal cardiac sound produced higher overall accuracy 
compared to the abnormal cardiac sound with 60.85% and 57.24%, respectively. This value of accuracy is 
acceptable with the number of samples. As number of samples increase, the accuracy value also will be 
increased unless there is less distorted data. Moreover, this study is only limited to the cardiac sounds’ 
envelope-based detection based on peak conditioning rule with manual threshold adjustment, thus, it is 
possible to do the adaptive approach and other methods like feature-based approach to improve the 
performance of automated cardiac sound analysis. 
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